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Abstract. The state complexity of a regular language is the number of states in a minimal 
deterministic finite automaton accepting the language. The syntactic complexity of a regular 
language is the cardinality of its syntactic semigroup. The syntactic complexity of a subclass 
of regular languages is the worst-case syntactic complexity taken as a function of the state 
complexity n of languages in that class. We prove that n’^~^ + n — 1, and + {n — 

2)2"'“^ -\- 1 are tight upper bounds on the syntactic complexities of right ideals and prefix- 
closed languages, left ideals and sufRx-closed languages, and two-sided ideals and factor-closed 
languages, respectively. Moreover, we show that the transition semigroups meeting the upper 
bounds for all three types of ideals are unique, and the numbers of generators (4, 5, and 6, 
respectively) cannot be reduced. 
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1 Introduction 

Formal definitions of the concepts introduced in this section are given in Section For now we 
assume that the reader is familiar with basic properties of regular languages and finite automata 
as covered in |24I29] . for example. 

There are two fundamental congruence relations in the theory of regular languages: the Nerode 
(right) congruence [52], and the Myhill congruence [51]. In both cases, a language is regular if and 
only if it is a union of congruence classes of a congruence of finite index. The Nerode congruence leads 
to the definitions of left quotients of a language and the minimal deterministic finite automaton 
(DFA) recognizing the language, and the Myhill congruence, to the definitions of the syntactic 
semigroup of the language. 

The state complexity of a language is the number of states in a minimal DFA recognizing the 
language. This concept has been studied extensively; for surveys and references see HEo]. The 
syntactic complexity of a regular language is the cardinality of its syntactic semigroup, which is 
isomorphic to the transition semigroup of a minimal DFA recognizing the language, where the 
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transition semigroup is the semigroup of transformations of the set of states of the DFA induced 
by non-empty words. 

Syntactic complexity does not refine state complexity, for there exist languages with the same 
syntactic complexity but different state complexities. However, it ofter helps to distinguish among 
languages with the same state complexity. For example, the DFAs in Fig. [T] all have the same 
alphabet, are all minimal, and all have state complexity three. However, the syntactic complexity 
of Vi is 3, that of 2?2 is 9, and that of T)^ is 27. 


a 


c a 


b, c 


a, b 


c 


c 




Fig. 1. DFAs with various syntactic complexities. 


The problem we study in this paper is the following: Given a language belonging to a subclass of 
the class of regular languages - for example, the subclass of finite languages or prefix-free languages 
(prefix-codes) - what is the maximal size of the syntactic semigroup of that language? Equivalently, 
given a minimal DFA of a language in the subclass, what is the maximal size of the transition 
semigroup of the DFA? A secondary problem is to find a minimal set of generators for the maximal 
semigroup. 

Syntactic complexity has been studied in several subclasses of regular languages other than 
ideals: prefix-, suffix-, bifix-, and factor-free languages mu-, star-free languages [SH; R- and J- 
trivial languages [5]; finite/cofinite and reverse definite languages [B]. This problem can be quite 
challenging, depending on the subclass; in the present case it is easy for right ideals but much more 
difficult for left- and two-sided ideals (defined below). 

As syntactic complexity bounds the maximal size of the transition semigroup, it provides a 
natural bound on the time and space complexities of algorithms dealing with transition semigroups. 
For example, a simple algorithm determining whether the language of a given DFA is star-free [20] 
- meaning that it can be generated from finite languages by using only Boolean operations and 
product (concatenation), but not star; equivalently, its syntactic semigroup is group-free, that is, 
has no non-trivial subgroups - requires the enumeration of all transformations and checking whether 
they do not contain non-trivial cycles. 

Maximal transition semigroups also play an important role in the study of most complex lan¬ 
guages [2] belonging to a given subclass. These are languages that meet all the upper bounds on 
the state complexities of Boolean operations, product, star, and reversal, have maximal syntactic 
semigroups and most complex atoms |12j . 
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In contrast to the syntactic monoid of the language, the syntactic semigroup may or may not 
contain the neutral element (the identity transformation). The presence of letters acting as identity 
is often important in the case of state complexity of binary operations. Moreover, the syntactic 
semigroup is more suitable to characterize some classes of languages, which have a description in 
terms of semigroups. For example, in the class of (co)finite languages all transformations must 
admit a certain linear order of the states and the identity transformation cannot be present; 
the latter condition would not be distinguished by the syntactic monoid. 

In this paper we study the syntactic complexities of right ideals (satisfying the equation L = 
LS*), left ideals (satisfying L = S*L), and two-sided ideals (satisfying L = E*LS*). Ideals are 
fundamental objects in semigroup theory. They appear in the theoretical computer science literature 
in 1965 [23] and continue to be of interest. Ideal languages are special cases of convex languages 
(see e.g. i), and they are complements of prefix-, suffix-, factor-, and subword-closed languages. 
Besides being of theoretical interest, ideals also play a role in algorithms for pattern matching. For 
this application, a text is represented by a word w over some alphabet E. A pattern is language L 
over E. An occurrence of a pattern represented by L in text w is a triple (u, x, v) such that w = uxv 
and a: is in L. Searching text w for words in L is equivalent to looking for prefixes of w that belong 
to the language E*L, which is the left ideal generated by L, or looking for factors of w that belong 
to E*LE*. 

The state complexity of operations on the classes of ideal languages was studied by Brzozowski, 
Jiraskova and Li [3]. The same problem for the classes of prefix-, suffix-, factor-, and subword- 
closed languages was studied by Han and K. Salomaa [E], Han, K. Salomaa, and Wood [T3], and 
Brzozowski, Jiraskova and Zou [4]. We refer the reader to these papers for a discussion of past work 
on this topic and additional references. 

The set of all n" transformations of a set of n elements is a monoid under composition of 
transformations, with identity as the unit element. In 1970, Maslov [19] dealt with the generators 
of the semigroup of all transformations in the setting of finite automata. Holzer and Konig m, and 
independently Krawetz, Lawrence, and Shallit [18] studied the syntactic complexity of unary and 
binary regular languages. Recently, syntactic complexity has been studied in several subclasses of 
regular languages other than ideals: prefix-, suffix-, bifix-, and factor-free languages mni; star-free 
languages |6I9| : R- and J-trivial languages [5]. 

We define our terminology and notation in Section [2 and give some basic properties of syntactic 
complexity in Section [3] The syntactic complexities of right, left, and two-sided ideals are treated 
in Sections HMl and Section [3 concludes the paper. As mentioned above, closed languages are 
complements of ideal languages. Since syntactic complexity is preserved under complementation, 
our proofs are for ideals only. The syntactic complexity of all-sided ideals remains open. 

In the proof for the upper bounds for left and two-sided ideals we use the method of injective 
function, which is generally applicable for other subclasses of regular languages (see m for suffix- 
free and [55] for bifix-free languages). The proofs presented here are the first that apply this method 
to syntactic complexity. 

A part of the results in this paper previously appeared in conference proceedings: In 2011 in |13j 
syntactic complexity of right ideals was established and lower bounds for the classes of left and 
two-sided ideals were presented. In 2014 in [10] incomplete proofs of the upper bounds for syntactic 
complexity of left and two-sided ideals were presented. 
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2 Preliminaries 

If Z" is a non-empty finite alphabet, then S* is the free monoid generated by S, and is the free 
semigroup generated by S. A word is any element of S*, and the empty word is s. The length of 
a word w £ E* is |r(;|. A language over S is any subset of E*. 

If re = uxv for some u,v,x £ E*, then m is a prefix of w, v is a. suffix of w, and x is a faetor 
of w. A prefix or suffix of w is also a factor of w. If re = U 1 V 1 U 2 V 2 ■ ■ ■ UkVkUk+h where the Ui and 
Vi are in E*, then ViV 2 - ■ ■ is a subword of w. A language L is prefix-closed ii w G L implies 
that every prefix of w is also in L. In an analogous way, we define suffix-closed, factor-closed, and 
subword-closed. We refer to all four types as closed languages. 

The shuffle uluv of two words u,v G E* is defined as follows: 

MLU u = {uiVi ■ ■ -UkVk \ u = Ui- ■ ■Uk,v = Vi ■ ■ ■Vk,ui,... ,Uk,vi,... ,Vk G Z*}. 

The shuffle of two languages K and L is defined by 

K lu L = (J MLut;. 

uGK,vGL 

A language L C Z* is a right ideal (respectively, left ideal, two-sided ideal, all-sided ideal) if it 
is non-empty and satisfies L = LE* (respectively, L = E*L, L = E*LE*, L = E* lu L). We refer 
to all four of these types as ideal languages or simply ideals. 

A transformation of a set Qn of n elements is a mapping of Qn into itself, whereas a permutation 
of Qn is a mapping of Qn onto itself. In this paper we consider only transformations of finite sets, 
and we assume without loss of generality that Qn = {0,l,...,n — 1}. An arbitrary transformation 
has the form 

0 — 2 n —1 

qoqi ■■■ qn -2 qn-i 

where qk G Qn for 0 ^ A: ^ n—1. The image of element q under transformation t is denoted by qt. The 
identity transformation 1 maps each element to itself. For fc ^ 2, a transformation (permutation) s of 
a set P = {po,pi,... ,Pk-i} C is a k-cycle if pos = pi,pis = p 2 ,.. ■ ,Pk- 2 S = pk-i,Pk-is = Pa¬ 
li a transformation t on Qn acts on P C like a fc-cycle then t is said to have a k-cycle. A fc-cycle 
is denoted by {po,pi,... ,pk-i) when it is viewed as a transformation of P. If t is a transformation 
of Qn, has a A:-cycle {pa,pi,. ■. ,Pk-i) of P, and acts as identity on Qn \ P, then we denote t also 
by {pa,pi,... ,pk-i). A 2-cycle (po^Pi) is called a transposition. A transformation is constant if it 
maps all states to a single state g; it is denoted by (Q —t 9 ). A transformation that maps a single 
state p to g and keeps (5\ {p} unchanged is denoted by (p —>■ g). If ic is a word of Z*, the fact that w 
induces transformation t is denoted hy w: t. A transformation mapping p to qp for p = 0,..., n — I 
is sometimes denoted by [go, ■ ■ ■, gn-i]- 

The following facts are well-known [25127]: 

Proposition 1. The complete transformation monoid Tn of size n" can be generated by any cyclic 
permutation of n elements together with a transposition and a singular (non-invertible) transforma¬ 
tion r = (n— 1 —>■ 0) of rank (image size) n—1. In particular, Tn can be generated by {D,l,... ,n—l), 
(0,1) and (n — 1 —>■ 0). Moreover, Tn cannot be generated by fewer than three generators for n ^ 3. 

The left quotient, or simply quotient, of a language P by a word w is the language w~^L = 
{a: £ Z* I wx G L}. An equivalence relation ^ on Z* is a left congruence if, for all x,y G E*, 
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X ^ y ux ^ uy, for all u S E*. It is a right congruence if, for all x^y € E*, x ^ y xv ^ 
yv, for all v G E*. It is a congruence if it is both a left and a right congruence. Equivalently, ^ is 
a congruence \i x ^ y uxv ^ uyv, for all u,v G E*. 

For any language L Q E*, define the Nerode (right) congruence [52] '^l of L by 

x^lV if and only ii xv G L ^ yv G L, for all v G E*. (1) 

Evidently, x~^L = y~^L if and only if x^^y. Thus, each equivalence class of this congruence 
corresponds to a distinct quotient of L. Let K = {Kq, ... be the set of quotients of a 

regular language L; by convention, we let Kq = L = e~^L. The number of distinct quotients of L 
is the quotient complexity k{L) of L. 

The Myhill congruence [21j of L is defined by 

x^lV if and only if uxv G L uyv G L for all u,v G E*. (2) 

This congruence is also known as the syntactic congruence of L. The semigroup E~^ of equiva¬ 
lence classes of the relation is the syntactic semigroup of L, and E* is the syntactic monoid 
of L. The syntactic complexity (j(L) of L is the cardinality of its syntactic semigroup. 

A deterministic finite automaton (DFA) is a quintuple V = {Q, E,6,qo, F), where Q is a finite, 
non-empty set of states, A is a finite non-empty alphabet, S: Q x E ^ Q is the transition function, 
qo G Q is the initial state, and F Q Q is the set of final states. By the language of a state q oiT) we 
mean the language Kq accepted by the automaton {Q,E,5,q,F). States p and q are equivalent if 
Kp = Kq. A state q is reachable if S(qo,w) = q for some w G E*. A DFA is minimal if every state 
is reachable and no two states are equivalent. 

The quotient automaton of L is 2? = {K, E,S, L, F), where 6{Kq,a) = a~^Kq, and F = {Kq \ 
e G Kq}. The quotient automaton is always minimal, and so quotient complexity is the same as 
state complexity. 

The transition semigroup of a DFA is the set of transformations induced by words of E~^ on 
the set of states. The transition semigroup of the quotient DFA of L is isomorphic to the syntactic 
semigroup of L [5S] . 

3 Syntactic Complexity of Languages with Special Quotients 

We now present some basic properties of syntactic complexity. 

Proposition 2. For any L C E* with k{L) = n>l,n — ^ «.”■ 

Proof. Let V = {K, E,6, L, F) be the quotient automaton of L. Since every state other than L 
has to be reachable from the initial state L by a non-empty word, there must be at least n — 1 
transformations. If A = {a} and L = then k{L) = n, and cr(L) = n — 1; so the lower bound 

n — 1 is achievable. The upper bound is n", and by Proposition [T] this upper bound is achievable if 
1571 >3. 

If one of the quotients of L is 0 (respectively, {e}, E*, A+l, then we say that L has 0 (respectively, 
{e}, E*, 57+). A quotient w~^L of a language L is uniquely reachable [T] if x~^L = w~^L implies 
that X = w.li {wa)~^L is uniquely reachable for a G E, then so is w~^L. Thus, if L has a uniquely 
reachable quotient, then L itself is uniquely reachable by e, i.e., a minimal automaton of L is 
non-returning m- 
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Theorem 3 (Special Quotients). Let L C S* and let k{L) = n ^ 1. 

1. If L has 0 or E*, then cr(L) ^ 

2. If L has {e} or , then cr(T) ^ 

3. If L is uniquely reachable, then u{L) ^ {n — 1)". 

4- If w~^L is uniquely reachable by w G S* with 0 ^ |w| ^ n — 1, then cr{L) ^ |w| + (n — 1 — |w|)". 
Moreover, all the bounds shown in Table\^hold. 


Table 1. Upper bounds on syntactic complexity for languages with special quotients. The abbreviation 
“ur” stands for “uniquely reachable”. The a in the last column is in E. 


0 

E* 

{£} 

E+ 

(j{L) < 

if also L is ur 

if also a is ur 

V 





(n- 1)"-^ 

1 + (n - 3)"-^ 


V 




(n- 1)"-^ 

l + (n-3)"-^ 
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V 



(n- 1)"-^ 

1 + (n-4)"-^ 


V 


V 


(n- I)"-"" 

1 + (n - 4)"-^ 

V 

V 




(n- 1)"-'' 

1 + (n - 4)"-^ 

V 

V 


V 


(n- 1)"-^ 

1 + (n - 5)"-^ 

V 

V 

V 



(n- 1)"-^ 

1 + (n- 5)"-^ 

V 

V 

V 

V 

^n-4 

(n- l)"-4 

l + ln-6)^-* 


Proof. Suppose that L C E*, n ^ 1, and k{L) = n. 

1. Since a“^0 = 0 for all a G E, there are only n — 1 states in the quotient automaton with which 
one can distinguish two transformations. Hence there are at most transformations. If L 
has E*, then a~^E* = E*, for all a G E, and the same argument applies. 

2. Since a~^{e} = 0 for all a G E, L has 0 if L has {e}. Now there are two states that do 
not contribute to distinguishing among different transformations. Dually, a~^E'^ = E* for all 
a G E, and the same argument applies. 

3. If L is uniquely reachable then w~^L = L implies w = e. Thus L does not appear in the image 
of any transformation by a word in E'^, and there remain only n — 1 choices for each of the n 
states. 

4. If 111“^L is uniquely reachable, then so is x~^L for every prefix x oi w. Hence for each prefix x 
of w, x~^L appears only in one transformation, and there are 1^1 such transformations. All the 
other transformations map every quotient x~^L to y~^L, where y is not a prefix of w. Therefore 
there can be at most (n — 1 — |ry|)" other transformations. 

The remaining entries in Table [T] are easily verified: every transformation fixes 0, E*, maps {e} 
to 0, and maps E~^ to E*, so these quotients are removed from counting possible mappings for a 
quotient. 
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4 Right Ideals and Prefix-Closed Languages 

In this section we prove that the syntactic complexity of right ideals is First we define a 

witness DFA that meets this bound. 

Definition 4 (Witness: Right Ideals). For n ^ S, let W„ = {Qn, ^w, 0, {n— 1}), be the DFA 
in which E = {a,b,c,d}, a: (0,..., n — 2), b: (0,1)> c- {n — 2 —>■ 0), and d: (n — 2—— 1). For 
n = 3 inputs a and b induce the same transformation; hence E = {a,c,d} suffices. Furthermore, 
let yV 2 = (< 52 , 0 , {!})> where a: (0 —>■ 1 ), and b: 1 , and letWi = (Qi, {a}) 0 , { 0 }); 

where a: 1 . Let L„ = LfWn)- 

The structure of the DFA of Definition 2] is shown in Fig. [2] for u ^ 3. 


CjdjjCjd b,c,d h a,b,c,d 



Fig. 2. Quotient DFA W„ of a right ideal with n" ^ transformations. 


Lemma 5. The DFA Wn of Definition is minimal, accepts a right ideal, and has transition 
semigroup of size 

Proof. If u ^ 2 this is easily verified; here Li = E* and L 2 = E*aE*. 

For n ^ 3, state q with O^q^n — 2is non-final and accepts and no other such state 

accepts this word. Since n — 1 is final, all states are distinguishable. Since Wn has exactly one final 
state and that state accepts E*, Ln is a right ideal. 

For the syntactic complexity, observe that inputs a, b, and c restricted to Qn-i can induce any 
transformation of Qn-i (Proposition [T]); hence all (n — 1)”“^ transformations that fix n — 1 can 
be performed by Wn. Also observe that any transformation (g —>■ n — 1 ) for g G {0,..., n — 3} is 
induced by a’"- 2 -'?da'?+^ 

Note that every transformation from the transition semigroup of W„ fixes state n — 1 . Let t be 
any transformation such that (n — l)t = n — 1. There are (n — 1)" such transformations, and we 
will show that all of them are generated. Let {pi,... ,pk} be the set of all states from Q \ {n — 1} 
that are mapped by f to n — 1. Then t can be generated by (pi —>■ n — 1) • • • {pk —t n — l)t', where 
t' fixes n — 1 and all states pi, and acts as t on the other states; thus it is a transformation of Qn-i 
if restricted to Qn-i and can be generated by a, b, and c. 

We are now in a position to state our main theorem of this section. 

Theorem 6 (Right Ideals and Prefix-Closed Languages). Suppose that L C E* and k{L) = 
n. If L is a right ideal or a prefix-closed language, then cr{L) ^ n"“^. This bound is tight for n = 1 
if lAI ^ 1, for n = 2 if lAj ^ 2, for n = 3 if lAI ^ 3, and for n'^ A if | Al 4. Moreover, the sizes 
of the alphabet cannot be reduced. 
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Proof. If L is a right ideal, it has S* as a quotient. By Theorem [3l a{Ln) < By Lemma [5] 

the languages of Definition |4] meet this bound. 

It is easy to verify that the alphabet cannot be smaller if n ^ 3. Let n ^ 4. The transition 
semigroup restricted to Qn-i contains all transformations Qn-i —>■ Qn-i- From Proposition [T] 
there must be three generators of these transformations, say a, b, c. They cannot map any state 
from Qn-i to n — I. Thus we need one more generator, say d, which maps a state from Qn-i to 
n — 1. 

Since prefix-closed languages are complements of right ideals and the syntactic complexity is 
preserved by complementation, the result is the same for prefix-closed languages. 

Remark 1. A maximal transition semigroup of the quotient DFA of a right ideal contains all trans¬ 
formations of Qn that fix state n — I. Hence there is only one maximal transition semigroup for 
right ideals. 

5 Left Ideals and Suffix-Closed Languages 

5.1 Basic Properties 

Let Qn = {0,..., n— I}, let = (Qn, Ax), fe, 0, F) be a minimal DFA, and let T„ be its transition 
semigroup. Consider the sequence (0, Of, Ot^,...) of states obtained by applying transformation 
t G Tn repeatedly, starting with the initial state. Since Qn is finite, there must eventually be a 
repeated state, that is, there must exist i and j such that 0, Of,..., OP, ..., are distinct, 
but OP = OP; the integer j — i is the period of t. If the period is 1, t is said to be initially aperiodic; 
then the sequence is 0, Of,..., = OP. 

Lemma 8. If is the quotient DFA of a left ideal, all the transformations in Tn are initially 
aperiodic, and no state ofVn is empty. 

Proof. Suppose that w induces a transformation t such that pi = OP = OP = pj for some i < j, 
where j — i ^ 2; thus t is not initially aperiodic. Since Dn is minimal, states pi and Pj-i must be 
distinguishable, say by word x G S*. If w^x G L, then w^~^x = w'^'w^~^~^x = w^~'^~^{w^x) ^ L, 
contradicting the assumption that L is a left ideal. If w^~^x G L, then w^x = w(w^~^x) ^ L, again 
contradicting that L is a left ideal. 

For the second claim, we know that a left ideal is non-empty by definition. So suppose that 
w G L.li L has the empty quotient, say x~^L = 0, then xw ^ L, which is a contradiction. 

Example 9. Note that the conditions of Lemma |S] are not sufficient. For S = {a, &}, the language 
L = bU E*a satisfies the conditions, but is not a left ideal because b G L but ab ^ L. Its quotient 
automaton is shown in Fig. [3] 

If the final state is 2 instead of I, the language becomes L' = SS*b = S*Sb, which is a left 
ideal. The languages L and L' have the same syntactic semigroup, but one is a left ideal while the 
other is not. 


Remark 10 language L C is a left ideal if and only if for all x,y G S*, y C (xy) ^L. 

Hence, if x~^L L, then L C x~^L for any x G . 
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a 


b 



a 

Fig. 3. Quotient DFA of a language that is not a left ideal. 


It is useful to restate this observation it terms of the states of Vn- For DFA Vn and states 
p,q G Qm we write p < q \i Kp G Kq. 

Remark 11. A DFA Vn is a minimal DFA of a left ideal if and only if for all s, t £ TjjUll}, Of ^ Ost. 
If Ot 7 ^ 0, then 0 A Of for any t G Tn- Also, if r £ has a t-predecessor, that is, if there exists 
q G Qn such that qt = r, then Ot ^ r. (This follows because g = Os for some transformation s since 
q is reachable from 0; hence 0 A q and Ot ^ qt = r.) In particular, if r appears in a cycle of t or is 
a hxed point of t, then Ot A r. 

We consider chains of the form ATij C C • • • C where the Ki^ are quotients of L. If 
L is a left ideal, the smallest element of any maximal-length chain is always L. Alternatively, we 
consider chains of states starting from 0 and strictly ordered by Since C is a partial order on 
the quotients , by dehnition of A we have the following: 

Proposition 12. For t £ T„ and p,q G Qn, P ^ q implies pt A qt. If p < pt, then p < pt < ■ ■ ■ < 
pt^ = pt^^^ for some k ^ 1. Similarly, p > q implies pt ^ qt, and p pt implies p pt >- ■ ■ ■ >- 
pt^ = pt^'^^ for some k ^ 1. 



5.2 Lower Bound 

We now show that the syntactic complexity of the following DFA of a left ideal is + n — 1. 

Definition 13 (Witness: Left Ideals). For n ^ 3, let Wn = {Qn,Idw,Sw,0,{n — 1}), be the 
DFA in which Ew — {a, b, c, d, e}, a: (1,..., n — 1), b: (1,2), c: (u — 1 —>■ 1), d: (n — 1 —>■ 0), 
and e: {Qn —t !)• For n = 3, a and b coincide, and we can use Aw = {a,c,d,e}. Also, let W 2 = 
{Q 2 , {a, b, c}, Sw, 0, {!}), where a: (0 —)> 1), 6: 1 , and c: {Q 2 —t 1), and let Wi = {Qi, {a}, <5, 0, {0}), 
where a: 1 . Let Ln = L{Wn)- 

The structure of the DFA of Definition IT^ is shown in Fig. |4] for n ^ 3. 

Lemma 14. The DFA of Definition \lS\ is minimal, accepts a left ideal, and has transition semigroup 
of size -|- n — 1 that contains all transformations fixing 0 and all the constant transformations. 

Proof. State 0 does not accept a* for any i, whereas state i with 1 < i < n — 2 accepts 

and no other state of this type accepts this word. Since n — 1 is the only final state, all states are 

distinguishable. 

To prove that L is a left ideal it suffices to show that for any w G L, we also have xw G L 
for every x G S. This is obvious if a: £ A \ {e}. li w G L, then w has the form w = uev, where 
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e 



Fig. 4. Quotient DFA Wn of a left ideal with n" ^ + n — 1 transformations. 


(5w(0, u) = 0, ^vv(0,Me) = 1, and v is accepted from state 1. But dw{0,eue) = 1, and since v is 
accepted from 1, we have euev = ew G Thus is a left ideal. 

In Wn, the transformations induced by a, 6, and c restricted to Qn \ {0} generate all the 
transformations of the last n — 1 states (Proposition[T]). Together with the transformation of d, they 
generate all transformations of Qn that fix 0, and the number of such transformations is n"“^. To 
see this, consider any transformation t that fixes 0. If some states from {1,..., n — 1} are mapped 
to 0 by t, we can map them first to n — 1 and n — 1 to one of them by the transformations of a, 6, 
and c, and then map n — 1 to 0 by the transformation of d. 

Also the words of the form ea* for i G {0, ..., n—2} induce constant transformations {Qn —t *+!)■ 
Hence the transition semigroup of contains all the constant transformations of Qn (where 
{Qn —t 0) has been already counted before). Altogether, there are +n — 1 transformations in 
the transition semigroup of W„. 

Example 15. One verifies that the maximal-length chains of quotients in W„ have length 2. On 
the other hand, for n ^ 2, let if = {a, &} and let L = Then L has n quotients and the 

maximal-length chains are of length n. 

We will see that the maximal length of chains of quotients is an important structural feature; 
in particular, to meet the bound for syntactic complexity by both left and two-sided ideals, the 
maximal length of the chains must be the smallest possible. 


5.3 Upper Bound 

The derivation of the upper bound -1- n — I for left ideals is much more difficult that that 

for right ideals. Our approach is as follows: We consider a minimal DFA = (Q^, Ux), du, 0, U) 
of an arbitrary left ideal with n quotients and let T„ be the transition semigroup of We also 
deal with the witness DFA >V„ = {Qn, 0, {n — I}) of Definition [T3l that has the same state 

set as and whose transition semigroup is Sn- We shall show that there is an injective mapping 
f: Tn ^ Sn, and this will prove that |T„| ^ j^nj. 
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Remark 16. n = \, the only left ideal is S* and the transition semigroup of its minimal DFA 
satisfies the bound 1° + 1 — 1 = 1. If n = 2, there are only three allowed transformations, since 
the transposition (0,1) is not initially aperiodic and is ruled out by Lemma [51 Thus the bound 
+ 2 - 1 = 3 holds. 

Lemma 17. // n ^ 3 and a maximal-length chain in 'Dn strictly ordered by -< has length 2, then 
\Tn\ ^ + n — 1 and T„ is a subsemigroup of Sn- 

Proof. Consider an arbitrary transformation t G and let p = Of. If p = 0, then any state other 
than 0 can possibly be mapped by t to any one of the n states; hence there are at most such 
transformations. All of these transformations are in Sn by the proof of Lemma [HI 

If p yf 0, then 0 ^ p. Consider any state q ^ {0,p}; by Remark (TTl p ^ gf. If p yf qt, then 
p ^ qt. But then we have the chain 0 ^ p ^ gf of length 3, contradicting our assumption. Hence 
we must have p = qt, and so t is the constant transformation t = {Qn —f p). Since p can be any 
one of the n — I states other than 0, we have at most n — I such transformations. Since all of these 
transformations are in Sn by Lemma 1141 T„ is a subsemigroup of Sn • 

Lemma 18 (Left Ideals, SufRx-Closed Languages). If n ^ 3 and L is a left ideal or a suffix- 
closed language with n quotients, then its syntaetic eomplexity is less than or equal to +n — 1. 

Proof. It suffices to prove the result for left ideals, since sufHx-closed languages are their comple¬ 
ments. 

For a transformation f G T„, consider the following cases: 

Case 1: t G Sn- 

Let f{t) = t; obviously /(f) is injective. 

Case 2: t ^ Sn and Of^ Of. 

Note that t ^ Sn implies Of yf 0 by Lemma [T4l Let Of = p. We have p = Of ^ Off = pf by Remark ITT] 
Let p ^ ^ pf*^ = pf^+^ be the chain defined from p; this chain is of length at least 2. Let /(f) = s, 

where s is the transformation defined by 

Os = 0, pf^s = p, qs = qt for the other states qG Qn- 

Transformation s is shown in Fig. [SI where the dashed transitions show how s differs from f. 

By Lemma [Ml s G Sn. However, s ^ T„, as it contains the cycle (p,... ,pf*) with states strictly 
ordered by ^ in DFA which contradicts Proposition [121 Since s ^ T„, it is distinct from the 
transformations defined in Case 1. 

In going from f to s, we have added one transition (Os = 0) that is a fixed point, and one 
(pf^s = p) that is not. Since only one non-fixed-point transition has been added, there can be only 
one cycle in s with states strictly ordered by ^. Since 0 cannot appear in this cycle, p is its smallest 
element with respect to 

Suppose now that f' ^ f is another transformation that satisfies Case 2, that is, Of' = p' 0 and 
p'f' p'; we shall show that /(f) ^ /(f')- Define s' for f' as s was defined for f. For a contradiction, 
assume s = f(t) = f{t') = s'. 

Like s, s' contains only one cycle strictly ordered by and p' is its smallest element. Since we 
have assumed that s = s', we must havep = Of = Of' = p' and the cycles in s and s' must be identical. 
In particular, pf'^f = pf^ = p{t'Yt' = p{t')^. For q oi Qn \ {0,pf^}, we have qt = qs = qs' = qt'. 
Hence f = f'—a contradiction. Therefore f ^ f' implies /(f) /(f'). 
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t: 








Fig. 5. Case 2 in the proof of Lemma [TSl 


Case 3 : t ^ Sn and = Of. 

As before, let Of = p. Consider any state q ^ {0,p}; then 0 ^ q by Remark [TT] and Of ^ qt by 
Proposition [TH Thus either p qt, or p = qt. We consider the following sub-cases: 

• (a): f has a cycle. 

Since f has a cycle, take a state r from the cycle; then r and rf are not comparable under ^ by 
Proposition 1121 and p ^ r hy Remark 1111 Let /(f) = s, where s is the transformation shown in 
Figure [6] and defined by 

Os = 0, ps = r, qs = qt for the other states q € Qn- 



Fig. 6. Case 3(a) in the proof of Lemma ITSl 


By Lemma [Ml s € Sn- Suppose that s € Tn', since p ^ r, we have r = ps < rs = rt hy the 
definition of s and Proposition [T^l this contradicts that r and rf are not comparable. Hence s ^ T^, 
and so s is distinct from the transformations of Case 1. 
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We claim that p is not in a cycle of s; this cycle would have to be 


P 


rt 


rt 


k-l 


■ p, that is, p ^ r 


rt 


rt 


k-l 


t 


for some k ^ 2 because r ^ p = pt and rt ^ p. Since p ^ r we have p -< rt; but then we have a 
chain p ^ rt ^ ^ rt^ = p, contradicting Proposition 1121 

Since p is not in a cycle of s, it follows that s does not contain a cycle with states strictly ordered 
by -<, as such a cycle would also be in t. So s is distinct from the transformations of Case 2. 

We claim there is a unique state q such that (a) 0 ^ g ^ qs, (b) qs qs^. First we show that 
p satisfies these conditions: (a) holds because ps = r and p <r; (b) holds because ps = r, ps^ = rt 
and r and rt are not comparable. Now suppose that q satisfies the two conditions, but q ^ p. Note 
that qs ^ p, because qs = p implies qs = p < r = qs'^, contradicting (b). Since q,qs ^ {0,p}, we 
have qt = qs qs^ = qt^. But Proposition [T^ for q < qt implies that qt ^ qt^ —a contradiction. 
Thus p is the only state satisfying these conditions. 

If t' ^ t is another transformation satisfying the conditions of this case, we define s' like s. 
Suppose that s = f{t) = f{t') = s'. Since both s and s' contain a unique state p satisfying the two 
conditions above, we have Ot = Ot' = p and pt = pt' = p. Since the other states are mapped by s 
exactly as by t and t', we have t = t'. 

•(b): t has no cycles and has a fixed point r ^ p. 

Because 0 ^ r by Remark [TTl Ot ^ rt by Proposition [121 If r is a fixed point of t, then p = Qt < 
rt = r. Since r ^ p, we have p ^ r. Let f{t) = s, where s is the transformation shown in Figure [7] 
and defined by 


Os = 0, qs = 0 for each fixed point q ^ p, qs = qt for the other states q € Qn- 

By Lemma [T4l s G Sn- Suppose that s G T„; because p < r, ps = p, rs = 0, and ps ^ rs by 
ProDOsition ll2l we havep ^ 0, which is a contradiction. Hence s is not in T„ and so is distinct from 
the transformations of Case 1. Also, s maps at least one state other than 0 to 0, and so is distinct 
from the transformations of Case 2 and also from the transformations of Case 3(a). 





Fig. 7. Case 3(b) in the proof of Lemma (TS] 
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lit' is another transformation satisfying the conditions of this case, we define s' like s. Now 
suppose that s = f{t) = f{t') = s'. There is only one fixed point of s other than 0 {ps = p), and 
only one fixed point of s' other than 0 {p's' = p'); hence Ot = p = p' = Ot'. By the definition of s, for 
each state 9 yf 0 such that qs = 0, we have qt = q. Similarly, for each state q 0 such that qs' = 0, 
we have qt' = q. Hence t and t' agree on these states. Since the remaining states are mapped by s 
exactly as they are mapped by t and t', we have t = t'. Thus we have proved that t ^ t' implies 

fit) + fit')- 

• (c): t has no cycles, has no fixed point r p, and there is a state r such that p ^ r with 
rt = p. 

Let f(t) = s, where s is the transformation shown in Figure [5] and defined by 

Os = 0, ps = r, qs = 0 for each q> p such that qt = p, 
qs = qt for the other states q € Qn- 



Fig. 8. Case 3(c) in the proof of Lemma fTSl 


By LemmaHU s G Sn- Suppose that s G T„; because p < r, ps = r, rs = 0, and r = ps ^ rs = 0 
by Proposition m we have r -< 0—a contradiction. Hence s ^ and s is distinct from the 

transformations of Case 1. 

Because s maps at least one state other than 0 to 0 (rs = 0), it is distinct from the transforma¬ 
tions of Case 2 and 3(a). Also s does not have a fixed point other than 0, while the transformations 
of Case 3(b) have such a fixed point. 

We claim that there is a unique state q such that {a) 0 < q < qs and (b) qs^ = 0. First we show 
that p satisfies these conditions. By assumption 0 ^ p ^ r and rt = p; also rs = 0 by the definition 
of s. Condition (a) holds because 0 ^ p ^ r = ps, and (b) holds because 0 = rs = ps^. 

Now suppose that 0 < q < qs, qs'^ = 0 and q ^ p. Since qs ^ 0, we have qs = qt by the definition 
of s. Because qt has a t-predecessor, p ^ gt by Remark [TTJ Also qt = qs ^ p, for qs = p implies 
0 = qs^ = ps = r —a contradiction. Hence p ^ qt. From qt = qs and q < qs, we have q < qt. Since 

= 0 we have iqt)s = 0 and so {qt)t = p, by the definition of s. By Proposition[121 from q qt 
we have qt ^ iqt)t = p, contradicting p ^ qt. So q = p. 
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If <' yf t is another transformation satisfying the conditions of this case, we define s' like s. 
Suppose that s = f{t) = t(t') = s'. Since s and s' contain a unique state p satisfying the two 
conditions above, we have Ot = Of = p and pt = pf = p. Then r and the states q y p with qt = p 
are determined by p, since they are precisely the states q >- p with qs = 0. Since the other states 
are mapped by s exactly as by t and f, we have t = f, and / is again injective. 

• All cases are covered: 

See the appendix for a list of the cases. We need to ensure that any transformation t fits in at least 
one case. It is clear that t fits in Case I or 2 or 3. For Case 3, it is sufficient to show that if (i) 
t ^ Sn does not contain a fixed point r ^ p, and (ii) there is no state r with p ^ r and rt = p, then 
t contains a cycle and so fits in Subcase 3(c). 

First, if there is no r such that p < r, we claim that t is the constant transformation (Qn —>■ p), 
thus it fits in Case 1. Consider any state q G Qn such that qt yf p. Then p ^ qt by Remark ITTl 
contradicting that there is no state r = qt such that p ^ r. 

So let t be a transformation that fits in Case 3 and satisfies (i) and (ii), and let r be some 
state such that p < r. Consider the sequence r,rt,rt^, .... By Remark [TTl p ^ rf for all i ^ 0. If 
rt^ = p for some fc ^ I, let k be the smallest such number, then rt^~^ y^ p; we have p ^ rt^~^ and 
{rt^~^)t = p, contradicting (ii). Since p is the only fixed point by (i), we have rf y^ rf~^ for all 
i ^ 1. Since there are finitely many states, rf = rf for some i and j such that 0 ^ i < j — 1, and 
so the states rf, rf~^^,... ,rf = rf form a cycle. 

We have shown that for every transformation t in T„ there is a corresponding transformation 
f{t) in Sn, and / is injective. So |T„| < \Sn\ = + n — 1. 

Next we prove that is the only transition semigroup meeting the bound. It follows that 
minimal DFAs of left ideals with the maximal syntactic complexity have maximal-length chains of 
length 2 . 

Theorem 19. If Tn has size + n — 1, then Tn = Sn- 

Proof. Consider a maximal-length chain of states strictly ordered by ^ in If its length is 2, 
then by Lemma [13 is a subsemigroup of Sn- Thus only Tn = Sn reaches the bound in this case. 

Assume now that the length of a maximal-length chain is at least 3. Then there are states p 
and r such that 0 ^ p ^ r. Let R = {q \ p q}, and let X = \ (i? U {0,p}). We shall show 

that there exists a transformation s that is in Sn but not in f{Tn). To define s we use the constant 
transformation u = {Qn —t p) as an auxiliary transformation. Let Os = 0, ps = r, rs = 0 for all 
r G R, and qs = qu = p ioi q G X; these are precisely the rules we used in Case 3(c) in the proof 
of Lemma [HI By Lemma [Ml s G Sn- 

It remains to be shown that there is no transformation t G Tn such that s = f{t). The proof 
that s is different from the transformations f{t) of Cases I, 2, 3(a) and 3(b) is exactly the same as 
the corresponding proof in Case 3(c) following the definition of s. 

It remains to verify that there is no u' G T„ in Case 3(c) such that f{u') = s. Suppose there is 
such a u'. Recall that states p and r satisfying 0 ^ p ^ r have been fixed by assumption. By the 
definition of s, state p satisfies the conditions (a) 0 ^ p ^ ps and (b) ps^ = 0. We claim that p is 
the only state satisfying these conditions. Indeed, if q y^ p then either qs = 0, q 7 ^ qs = 0 and (a) is 
violated, or qs = p, qs^ = ps = r y^ 0 and (b) is violated. This observation is used in the proof of 
Case 3(c) to prove the claim below. 

Both u and u' satisfy the conditions of Case 3(c), except that u fails the condition u ^ Sn- 
However, that latter condition is not used in the proof that if u y^ u' and u' satisfy the other 
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conditions of Case 3(c), then s' ^ s, where s' is the transformation obtained from u' by the rules 
of s. Thus s is also different from the transformations in f{Tn) from Case 3(c). 

Because s ^ f(Tn), s G Sn and /(T„) C 5'„, the bound n"“^ + n — 1 cannot be reached if the 
length of the maximal-length chains is not 2 . 

Proposition 20. For n ^ 4, the minimal number of generators of the transition semigroup Tn is 
5. 

Proof. Since all transformations mapping 0 to a state in (5n\{0} are constant transformations, they 
must be generated by constant generators. Let e be one of these generators. Transition semigroup 
Tn contains all transformations from Qn \ {0} to Qn \ {0} that fix 0. By Proposition [T] we need 
three generators for them, and the generators must fix 0 ; otherwise a generator would have to be 
constant, and the only constant transformation fixing 0 is {Qn 0). Let a,b,c be three of these 
generators. Finally, Tn contains transformations mapping some states from Qn \ {0} to 0, so we 
need one more generator d mapping a state other than 0 to 0 . 

We are finally in a position to prove our main theorem of this section. 

Theorem 21 (Left Ideals, Suffix-Closed Languages). Suppose that L C E* and k{L) = n. If 
L is a left ideal or a suffix-closed language, then <j{L) ^ -|- n — 1. This bound is tight for n = 1 

if |L’| ^ 1, for n = 2 if |T’| ^ 3, for n = 3 if |L’| ^ 4, and for n ^ A if 1^71 ^ 5. Moreover, the sizes 
of the alphabet cannot be reduced. 

Proof. If L is a left ideal, then a{Ln) ^ -|-n — 1 by Lemma [T51 By Lemma [HI the languages of 

Definition m meet this bound. It is easy to verify that the size of the alphabet cannot be reduced 
if n ^ 3. For u ^ 4, by Theorem 1191 only languages L whose quotient automaton has transition 
semigroup isomorphic to Tn meet the bound, and by Proposition [20] T„ requires 5 generators. 


6 Two-Sided Ideals 

If a language L is a right ideal, then L = LE* and L has exactly one final quotient, namely E*] 
hence this also holds for two-sided ideals. For n ^ 3, in a two-sided ideal every maximal chain is of 
length at least 3: it starts with L, every quotient contains L and is contained in E*. 


6.1 Lower Bound 

We now show that the syntactic complexity of the following DFA of a two-sided ideal is -|- 
(n - 2)2"-2-h 1. 

Definition 22 (Witness: Two-Sided Ideals). Porn ^ 4, define the DFA >V„ = {Qm L’w, (5w; 0, { 
1 }), where Ey^ = {o-^b,c,d,e, f}, a: ( 1 , ..., n — 2 ), b: ( 1 , 2 ), c: {n — 2 —>■ 1 ), d: {n — 2 —>■ 0), 
e: Qn-i —>■ 1, and /: (1 —>■ n — 1). For n = 4, inputs a and b coincide, and we can use Ey^ = 
{a, c, d, e, /}. Also, let Wa = (Qa, {a, b, c}, 0, {2}), where a: (1 —>■ 2)(0 —>■ 1), b: (1 —>■ 0), and 

c: 1, and let >V 2 = {Q 2 , {a, b}, Syy, 0, {!}), where a: (0 —>■ 1), and b: 1, Let Ln = LfWn). 

The structure of the DFA of Definition is shown in Fig. [HI for n ^ 4. 
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a, b, c, d, e, f 



Fig. 9. Quotient DFA of a two-sided ideal with n" ^ (n — 2)2" ^ -I- 1 transformations. 


Lemma 23. For n ^ 2, the DFA of Definition, is minimal, accepts a two-sided ideal, and its 
transition semigroup has size (n — 2 ) 2 "“^ -|- 1 . 

Proof. For i = 1,... ,n — 2, state i is the only non-final state that accepts hence all these 

states are distinguishable. State 0 is distinguishable from these states, because it does not accept 
any words in a* f. Hence Wn is minimal. The proof that Wn is a left ideal is like that in Lemma (TT] 
Since n — 1 is the only final state, is a right ideal. Hence it is two-sided. 

For n = 3, Wa meets the bound 6 with the transition semigroup consisting of the transformations 
[ 0 , 1 , 2 ], [ 1 , 2 , 2 ], [ 2 , 2 , 2 ], [ 0 , 0 , 2 ], [ 1 , 1 , 2 ], and [ 0 , 2 , 2 ]. Also, for n = 2, W 2 meets the bound 2 . 

From now on we may assume that n ^ 4. In W„, the transformations induced by a, b, and c 
restricted to Qn \ {0,n — 1} generate all the transformations of the states 1,... ,n — 2. Together 
with the transformations of d and /, they generate all transformations of Qn that hx 0 and 
n — 1. For any subset S C {1, ..., n — 2}, there is a transformation—induced by a word ws, say— 
that maps S' to n — 1 and fixes Qn \ S. Then the words of the form wsea^, for i S {0,..., n — 3}, 
induce all transformations that map SU{n—l}ton— 1 and \ (SU{n — 1}) to i -\- 1. There 
are 2 "“^ such transformations, and for each such transformation there are n — 2 possibilities for i. 
Hence there are (n — 2)2”“^ transformations of this type. There is also the constant transformation 
e/: {Qn n — 1), which yields the total number claimed. 

6.2 Upper Bound 

We consider a minimal DFA = {Qn, S-d, dT>,0, {n — 1}) of an arbitrary two-sided ideal with 
n quotients, and let be the transition semigroup of We also deal with the witness DFA 
FVn = {Qn, ^w, <^W) 0, {n — 1 }) of Definition [ 22 l with transition semigroup 5'„. 

Lemma 24. If n ^ 4 and a maximal-length chain in Dn strictly ordered by -< has length 3, then 
|T„j ^ -I- (n — 2)2"“^ -|- 1, and T„ is a subsemigroup of Sn- 
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Proof. Consider an arbitrary transformation t G Tn', then (n — l)t = n — 1. If Ot = 0, then any state 
not in {0, n — 1} can possibly be mapped by t to any one of the n states; hence there are at most 
such transformations. 

If Qt 7 ^ 0, then 0 ^ Ot. Consider any state q ^ {0, Of}; since is minimal, q must be reachable 
from 0 by some transformation s, that is, q = Os. If Ost ^ {Ot, n — I}, then Ot ^ Ost by Remark fTTl 
But then we have the chain 0 ^ Ot ^ Ost ^ n — 1 of length 4, contradicting our assumption. 
Hence we must have either Ost = Ot, or Ost = n — 1. For a fixed Ot, a subset of the states in 
Qn \ {0,n — 1} can be mapped to Ot and the remaining states in \ {0,n — 1} to n — I, thus 
giving 2”“^ transformations. Since there are n — 2 possibilities for Ot, we obtain the second part of 
the bound. Finally, all states can be mapped to n — 1. 

By Lemma [23] all of the above-mentioned transformations are in Sn- 

Lemma 25 (Two-Sided Ideals, Factor-Closed Languages). If L is a two-sided ideal or a 
factor-elosed language with n ^ 2 quotients, then its syntaetic complexity is less than or equal to 
n"-2-k(n-2)2"-2-hl. 

Proof. It suffices to prove the result for two-sided ideals, since factor-closed languages are their 
complements. 

If n = 1, the only two-sided ideal is E*, its syntactic complexity is 1, and so the upper bound 
is 1. If n = 2, each two-sided ideal is of the form L = E*rE*, where 0 C F C 27, its syntactic 
complexity is 2, and so the upper bound is 2, and this agrees with Lemma [23l If n = 3, there are 
eight transformations that are initially aperiodic and such that (n — l)t = n — 1 (the property of 
a right-ideal transformation). We have verified that the DFA having all eight or any seven of the 
eight transformations is not a two-sided ideal. Hence 6 is an upper bound. From now on we may 
assume that n ^ 4. 

As we did for left ideals, we show that |r„| ^ |S'„|, by constructing an injective function /: T„ —>• 
Sn. 

We have q ^ n — 1 for all q G Qn, and n — 1 is a fixed point of every transformation in T„ and 
Sn. 

For a transformation t G Tn, consider the following cases: 

Case 1: t G Sn. 

The proof is the same as that of Case 1 of Lemma [T51 
Case 2: t ^ Sn, and Ot^ ^ Ot. 

Let Ot = p pt^ = pt^^^ be the chain defined from p. 

• (a): pt^ ^ n — 1. 

The proof is the same as that of Case 2 of Lemma [181 

• (b): pt^ = n—1 and k ^ 2. 

Let f{t) = s, where s is the transformation shown in Fig. 1101 and defined by 

Os = 0, pPs = pP~^ for 1 ^ z < /c — 1, ps = n — 1, 
qs = qt for the other states q G Qn. 

By Lemma s G Sn. We have pt >- p, pts = p, and ps = n — 1. By Proposition [T^ pts Y ps, that 
is, p F n — 1, which contradicts the fact that p ^ n — 1 (since fc ^ 2), and q — 1 for all q G Qn. 
Thus s is not in T„, and so it is different from the transformations of Case 1. 
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Fig. 10. Case 2(b) in the proof of Lemma [25] 


Observe that s does not have a cycle with states strictly ordered by -<, since no state from 
{0,p,pt,... ,pt^~^} can be in a cycle, and t cannot have such a cycle. Hence s is different from the 
transformations of Case 2(a). 

In s, there is a unique state q such that qs = n — 1 and for which there exists a state r such 
that r q and rs = q, and that this state q must be p. Indeed, ii q ^ p, then qt = qs = n — 1 hy 
the definition of s. From r q, we have rt > qt = n — hence rs = rt = n — 1 and rt ^ q —a 
contradiction. Hence q = p. 

By a similar argument, we show that there exists a unique state q such that q > p, and qs = p, 
and that this state q must be pt. li q ^ pt then qs = qt. But q >- qt and p = qt qt^ = pt 
contradicts that p -< pt. Continuing in this way for ... ^pt^~^ we show that there is a unique 
chain pt'^~^ A- ■■■ A- pt A- p. 

lit'^t is another transformation satisfying the conditions of this case, we define s' like s. Now 
suppose that s = f(t) = f(t') = s'. Since we have a unique state p such that ps = n — 1 for which 
there exists a state r such that r >- p and rs = p, we have Ot = Ot' = p. Also the chain of states 
p,pt,pt^,... ,pA~^ is unique in s and s' as we have shown above; so pt* = pt'' for t = 1,..., fc — I. 
Since the other states are mapped by s exactly as by t and t', we have t = t'. 

• (c): pt = n — 1. 

Let P = {0,p, n — 1}. Since n ^ 4, there must be a state r ^ P. If p ^ r for all r ^ P, then 
n — 1 = pt <rt] hence rt = n — I for all such r, and qt S {p, n — 1} for all q G By Lemma (231 
there is a transformation in Sn that maps S' U {n — 1} to n — I, and Qn \ (S U {n — I}) to p for any 
S C {l,...,n — 2}. Thus t G S„—a contradiction. 

In view of the above, there must exist a state r ^ P such that p r. By Remark 1111 we have 
p ^ rt and of course rt ^ n — 1. If rt is p or n — 1 for all r ^ P, we again have the situation described 
above, showing that t G S„. Hence there must exist an r ^ P such that p r and p ^ rt ^ n — 1. 

Also we claim that t does not have a cycle. Indeed, if p ^ g, then q is mapped to n — 1; if p g, 
then q is mapped to a state qt ^ p and again q cannot be in a cycle since the chain starting with q 
ends in n — 1. 

Let /(t) = s, where s is the transformation shown in Fig. 1111 and defined by 


Os = 0, ps = rt, {rt)s = p, rs = 0, 
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qs = qt for the other states q € Qn- 



Fig. 11. Case 2(c) in the proof of Lemma [25l 


Since s fixes both 0 and n — 1, it is in iSn by Lemma [23l But s is not in T„, as we have the cycle 
(p, rt) with p -< rt. So s is different from the transformations of Case 1. Since s maps a state other 
than 0 to 0, it is different from the transformations of Cases 2(a) and 2(b). 

Observe that t does not map any state to 0; otherwise, it qt = 0 for some q, then 0 p implies 
q ^ 0 by Proposition [T21 which contradicts that 0 ^ q from Remark [TTJ since q is reachable from 0 
by some transformation. Consequently, in s there is the unique state r 7 ^ 0 mapped to 0. Also, as t 
does not contain a cycle, the only cycle in s must be {p,rt). 

lit' ^ t is another transformation satisfying the conditions of this case, we define s' like s. Now 
suppose that s = f{t) = f(t') = s'. Because both s and s' have the unique non-fixed point r mapped 
to 0, r = r'. Also s and s' contain the unique cycle (p, rt), p ^ rt. Thus p = p', pt = pt' = n — 1 
and rt = rt'. It follows that Ot = Ot' = p. Because p < rt = rt', we have {rt)t = {rt)t' = n — 1. The 
other states are mapped by s exactly as by t and t', and so t = t'. 

Case 3: t ^ Sn, Ot = p ^ 0, and pt = p. 

• (a): t has a cycle. 

The proof is analogous to that of Case 3(a) in Lemma fTSl but we need to ensure that s is different 
from the s of Cases 2(b) and 2(c). 

Here there is the state r such that r ^ rs, and rs and rs^ are not comparable under Consider 
a transformation t' that fits in Case 2(b). Then in s' every state q = pf for 0 ^ ^ fc — 1, and 

q = 0 , is mapped to a state comparable with q under and the other states are mapped as in t'. 
Since t' G T„ cannot contain a state r' such that r' -< r't and r't and r't^ are not comparable under 
it follows that s' also does not contain such a state. Thus s 7 ^ s'. 

For a distinction from the transformations of Case 2(c) observe that s does not map to 0 any 
state other than 0 . 
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. (b): t has no cycles and has a fixed point r ^ {p, n — 1}. 

The proof is analogous to that of Case 3(b) in Lemma [TH but we need to ensure that s is different 
from the s of Cases 2(b) and 2(c). 

Since s maps to 0 a state other than 0, this case is distinct from Case 2(b). Because t does not 
have a cycle, and no state q mapped to 0 can be in a cycle in s, it follows that s does not have a 
cycle. Thus s is different from the transformations of Case 2(c). 

• (c): t has neither a cycle nor a fixed point r ^ {p, n — 1}, and has a state r p mapped to p. 
The proof is analogous to that of Case 3(c) in Lemma [T51 but we need to ensure that s is different 
from the s of Cases 2(b) and 2(c). 

As before, since s maps to 0 a state other than 0, this case is distinct from Case 2(b). In s, 0 
cannot be in a cycle, no state q>- p mapped to 0 can be in a cycle and p cannot be in a cycle as 
ps = r and rs = 0. Since the other states are mapped as in t, s does not have a cycle. Thus s is 
different from the transformations of Case 2(c). 

. (d): t has no cycles, no fixed point r ^ {p, n — 1}, and no state r p mapped to p, and has a 
state r such that p ^ r ^ n — 1 that is mapped to u — 1. 

Let f{t) = s, where s is the transformation shown in Fig. [12] and defined by 

Os = 0, qs = q for states q such that qt = n — 1, ps = n — 1 qs = qt for the other states q G 



Fig. 12. Case 3(d) in the proof of Lemma [25] 


By Lemma E51 s G S'„. However, s is not in Tn, as we have a fixed point r such that p <r < n— 1 
and ps = n — 1. So Proposition 1121 yields n — 1 = ps ^ rs = r—a contradiction. Thus s is different 
from the transformations of Case 1. 

Transformation s does not have any cycles, as t does not have one in this case and fixed points q 
and p cannot be in a cycle. So s is different from the transformations of Cases 2(a) and 3(a). Also, 
since p is the unique state mapped to n — 1 and there is no state r >- p mapped to p, s is different 
from the transformations of Case 2(b). For a distinction from the transformations of Cases 2(c), 
3(b) and 3(c), observe that s does not map to 0 any state other than 0. 
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lit' is another transformation satisfying the conditions of this case, we define s' like s. Now 
suppose that s = f{t) = f{t') = s'. Observe that t does not have a fixed point other than n — 1. So 
for every fixed point q ^ {0,n — 1} of s we have qt = qt' = n — 1. Also, since p is the unique state 
mapped to n — 1 in s, Ot = Of = p and pt = pf = p. The other states are mapped by s as by t and 
t'; so t = f. 

• All cases are covered: 

See the appendix for a list of the cases. We need to ensure that any transformation t fits in at least 
one case. It is clear that t fits in Case 1 or 2 or 3. Any transformation from Case 2 fits in Case 2(a) 
or 2(b) or 2(c). For Case 3, it is sufficient to show that if (i) t ^ Sn does not contain a fixed point 
r ^ {p, n — 1 }, and (ii) there is no state r,p<r^n—l, mapped to p or n — 1 , then t has a cycle. 

If there is no state r such that p ^ r ^ n — 1, then gt S {p, n — 1} for all q S Qn, since qt Y p. 
By the proof of Lemma [23] in Sn for any S C \ {n — 1} there are all transformations that map 
S' U {n — 1} to n — 1, and the other states Qn \ (S U {n — 1}) to any state from Qn] thus t S Sn —a 
contradiction. 

So let t be a transformation that fits in Case 3 and satisfies (i) and (ii), and let r be some state 
such that p < r < n — 1. Consider the sequence r, rt, rt^,.... By Remark [TTl p ^ rf for all * ^ 0. 
If rt^ G {p,n — 1} for some fc ^ 1, let k be the smallest such number, then rt^~"^ ^ {p^n — 1}; we 
have p ^ rt^~^ -< n — I and {rt'^~^)t G {p, n — 1 }, contradicting (ii). 

Sincepandn—1 are the only fixed points by (i), we have rt* rf~^. Since there are finitely many 
states, rf = rf for some i and j such that 0 ^ i < j — 1 , and so the states rf, rt *+^... ,rP = rf 
form a cycle. 

Theorem 26. //T„ has size n"“^ + (n — 2)2"’“^ + 1, then Tn = Sn- 

Proof. The proof is very similar to that of Theorem 1191 

Consider a maximal-length chain of states strictly ordered by ^ in T>„. If its length is 3, then 
by Lemma |24|T„ is a subsemigroup of Sn- Thus only T„ = Sn reaches the bound. 

If there is a chain of length 4, then there are states p and r such that O^p^r^n — 1. Let / 
be the injective function from Lemma |25l Consider the transformation u that maps Qn \{n — 1} 
to p and fixes n — 1. Let s be defined from u in Case 3(c) of the proof of Lemma (251 The rest of 
the proof follows the proof of Theorem |T9| with Case 3(d) of Lemma [25l added. 

Proposition 27. For n ^ 4, the minimal number of generators of the transition semigroup Tn is 
6 . 

Proof. From Proposition |20| we know that the transformations in Tn restricted to Qn \ {^ ~ 1} 
require 5 generators. These generators in T„ do not map any state from Qn \ {n—l}ton—1, and 
must fix n — 1. Hence, we need one more generator that map a state from Qn \{n — 1} to n — 1. 

We are now in a position to prove our main theorem of this section. 

Theorem 28 (Two-Sided Ideals, Factor-Closed Languages). Suppose that L S* and 
k{L) = n > 1. If L is a two-sided ideal or a factor-closed language, then <j{L) ^ -I- (n — 

2 ) 2 "“^ _l_ ^ This bound is tight for n = 2 */ | Aj ^ 2 , for n = 3 */ | A| ^3, for n ^ 4 if \S\ ^5, and 
for n ^ 5 */ I A| ^ 6 . Moreover, the sizes of the alphabet cannot be reduced. 

Proof. This follows from Lemmas [23] and [25] It is easy to verify that the size of the alphabet cannot 
be reduced if n ^ 4. For n 5, by Theorem only languages L whose quotient automaton 
has transition semigroup isomorphic to T„ meet the bound, and by Proposition 1271 semigroup T„ 
requires 6 generators. 
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7 Conclusions 

We have found tight upper bounds on the syntactic complexity of right, left, and two-sided ideals. 

Despite the fact that the Myhill congruence has left-right symmetry, there are significant differences 
between left and right ideals. We have shown that in each of the three cases the maximal transition 
semigroup is unique. 

In our proof for left and two-sided ideals we exhibited an injective function from the transition 
semigroup of a minimal DFA of an arbitrary left, right, two-sided ideal language to the transition 
semigroup of the witness DFA attaining the upper bound for these languages. This approach is 
generally applicable for other subclasses of regular languages. For example, in m we have used 
this method to establish the upper bound for sufHx-free languages. 
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Appendix 

List of the cases in the proof of Lemma 1181 

Case 1: t G S'„. 

Case 2: t ^ Sn and ^ Ot. 

Case 3: t ^ Sn and = Ot. 

Case 3(a): t has a cycle. 

Case 3(b): t has no cycles and has a fixed point r ^ p. 

Case 3(c): t has no cycles, has no fixed point r p, and there is a state r such that p < r 
with rt = p. 


Case 1\ s — t. Case 2; 



Fig. 13. Map of the cases in the proof of Lemma [TSi The transitions of t are represented by solid lines, and 
the transitions of s by dashed red lines. 
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List of the cases in the proof of Lemma I25L 
Case li t G Sn- 

Case 2: t ^ Sn and ^ Ot. 

Case 2(a): t ^ Sn and Ot. 

Case 2(b): = n — 1 and k ^ 2. 

Case 2(c): pt = n — 1. 

Case 3: t ^ Sn, Ot = p 0 and pt = p. 

Case 3(a): t has a cycle. 

Case 3(b): t has no cycles and has a hxed point r ^ {p, n — 1}. 

Case 3(c): t has neither a cycle nor a fixed point r ^ {p,n — 1}, and has a state r > p 
mapped to p. 


Case 1: s — t. Case 2(a) and Case 3(a): 



Case 3(d): 



Fig. 14. Map of the cases in the proof of Lemma The transitions of t are represented by solid lines, and 
the transitions of s by dashed red lines. 













